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The data generated from social media is very large, while the use of data 
from social media has not been fully utilized to become new knowledge. 
One of the things that can become new knowledge is user habits on social 
media. Searching for user habits on Twitter by using user tweets can be done 
by using modeling, the use of modeling lies when the data has been 


preprocessed, and the ranking will then be checked in the dictionary, this is 


where the role of the model is carried out to get a chance that the words that 
Keywords: have been ranked will perform check the word in the dictionary. The benefit 
of the model in general is to get an understanding of the mechanism in the 


ne boa problem so that it can predict events that will arise from a phenomenon 
Habitual which in this case is user habits. So that with the availability of this model, it 
Naive Bayes can be a model in getting opportunities for user habits on Twitter social 
Probability media. 
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1. INTRODUCTION 

The use of data on social media is very diverse and growing rapidly [1], [2]. Currently, social media 
produces huge amounts of data every day [3], [4]. This very large data can be used for various purposes and 
depends on what purpose the data is used for [5]. Consciously or unconsciously, Twitter social media users 
every time they tweet produce a maximum of 140 characters of letters in it, where each character forms a 
word and then forms a sentence. Sentences formed from words have their own meaning [6], to get the habits 
of users on Twitter social media, it can be seen from the frequent repeated words used by users. Where as in 
real life, activities or words that are often done are habits, habits are closely related to words that are verbs. 
Searching by utilizing available data in the world of social media is growing rapidly, but each method or 
method used by developers and researchers is different depending on the purpose and each method they do 
has advantages and disadvantages [7], [8]. 

The word habit on social media means a lot [9], where the habits of users on social media become 
new knowledge that can be used for other purposes, such as for the industrial world and other communities. 
One of the word searches uses the string match [10], [11], by utilizing the match string to get the number of 
words that are repeated from the initial data where the ranking applies according to the repetition of a word 
and is matched on a dictionary that has been labelled for look for the top-ranking word according to the 
labelled word in the dictionary. To facilitate the search, it is necessary to develop a new approach using 
mathematics where mathematics is the basic science of the computer itself. By utilizing modelling that serves 
to get an understanding or clarity of the mechanism in the problem so that it can predict events that will arise 
from a phenomenon. 
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2. MATERIAL AND METHOD 
2.1. Document 
Every document on social media can be used for research purposes. A social media document has 
many interrelated contexts, including user-provided annotations containing information [12]. The annotation 
itself contains tags [13], time, place, title and others that are closely related to user posts. From the structured 
context into a social media document, it is used as a research source, especially text data to become a very 
valuable source of information [14]. Text is also the simplest type of representation of information. Text 
documents include text document classification, grouping, topic detection, and several other processes [15]. 
Usually, the document is symbolized by D, so if there are several documents it becomes Dy, D2, 
D3... Dn. The document itself consists of a series of sentences that are interconnected with words, the word is 
symbolized by w. The number of words that may be obtained from a sentence so that it is symbolized by wi, 
W2, W3.... Wn. 


2.2. String matching 

String matching technique is a pattern search in natural language processing, text, image processing, 
pattern and speech recognition that are commonly used [16]. There will be terms that are often encountered 
in string matching, namely patterns and text. The string matching algorithm is used to match a text with other 
text [17], [18]. A simple example of string matching is: i) Pattern: Watch and ii) Text: I watch animated 
movies on TV. 


2.3. Basic probability 

It is a statistical experiment which produces only one of many possible outcomes [19], [20]. The set 
of the whole possible oucames, the sample space is symbolized withl Q. This sample space is also known as 
the set of events. Usually, a result can be denoted by o. For an event probability is defined by P(.). For 
example, an experiment about choosing a word from a text "The definition of statistical experimentation can 
be widely stated as a process". So here the sample space is: 


QO. = {a, statistical, experiment, can, be, broadly, defined, as, a, process} (1) 
The incident occurred when choosing a word with: 
@, = 4,2 = Statistical,w3 = experiment, ...,W19 = process (2) 


The total word count of the text is 10, or it is also known as the cardinality of Q. Then it can be defined that 
the probability of choosing a word, for example, “experiment”, is written P(experiment) = To OF it can be 
written as: 


pe The number of occurrences of choosing the word experiment (3) 


The total number of events that can occur 


6699 


While the probability of choosing the word “a”, P(a) is =, because the number of words “a” in the text 


is 2. The complement of Q is the incident of Q not occured and symbolized by 0° with the note that P(Q) = 
1 — P(Q°). Moreover P (D|R) defines the conditional distribution of D where R is known. Furthermore, @ 
represents the empty set, that is @= {}. Suppose D and R are two occurrences of the sample space Q, finite with 
N elements, this can be expressed as a Venn diagram as shown in Figure |. The combination of events between 
D and R, can be described by D U R, where either event D or R or both occur [21]. 

In case of D/O R, the intersection of occurrence D and R [22]. Here, they are considered mutually 
exclusive when the matter of one event prevents another’s event as shown in Figure 2. If the object is in 
ellipse R, what is the probability that the object is also in D. In order to be in D, the object must also be in 
slice DM R. Therefore, the probability is equal to the number of components in |D M RI, split by those in R, 
i.e., [R|. Officially the pattern is as (4): 

_ [DAR] _ |DARI/|Q| _ P(@aR) 


= = (4) 


~ ARI IRI/|Q\ P(R) 


P(D|R) 


2.4. Architecture research 

To be directed and precise according to the rules, it is limited by general architectural research as 
shown in Figure 3. So, the discussion will be more focused on what the author wants to get. From the figure, 
the following steps are obtained: 
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i) Initial data from Twitter that is ready for use, where the data has been pre-processed previously [19], 
where pre-processing follows from the purpose of this study. 

ii) The preprocessing data is then ranked by word [23], where the assumption is that the word with repeated 
frequency is a word that refers to the habits of Twitter social media users. 

iii) The results of this ranking will be checked by the campus that has been prepared, normally checking is 
done using string matching. 

iv) Here the research environment plays a role between word ranking activities and checking the dictionary, 
where a new model is made to get opportunities for user habits on Twitter social media. 


Q Di Ds Da 
one D2 Ds 


Figure 1. Sum of 2 events Figure 2. Mutually exclusive 


GOAL 
New 
Approach 


Data for 
Analysis 
from Twitter 

i Check 


Word Ranking —>+ Dictionary 


Figure 3. Architecture research 


3. RESULTS AND DISCUSSION 
3.1. Total probability theorem 

The total probability theorem should be first comprehended before starting to discuss about Bayes' 
theorem, theorem [24]. Starting with the regulation to add two events A and B, it is clear form Figure | and 
Figure 2 that the (5). Suppose that the sample space is subdivided in n independent occurrence Dj, i=1.n, as 
seen in Figure 3. In Figure 3, it can be conluded if DR is expressed by R = (RM D1) U (RN D2) U (RN D3) U 
(RM Da) U (RN Ds) UU (RN D,) so that the total probability theorem can be obtained as: 


P(DU R) = P(D) + P(R) — P(N R) (5) 

=Y7, P(RID))P(D,) (6) 
becomes 

P(R) = P(RID) P(D) + P(R|Dc)P(De) (7) 


as D2 U D3 U... U Dn is the complement of D1. Bayes’ theorem [25] can be described: 
Suppose |D| # 0 dan |R| # 0 The following conditions can be stated: 


|DNR|_ P(DNR) 


_ |RND| = P(RND) 
P(R|D) = = (9) 


where the press (8) and (9) is clear as: 


P(D NR) = P(D|R)P(R) = P(R|D)P(D) (10) 
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P(R|D)P() 
P(R) 


P(D|R) = (11) 


When the sample © space can be split into a finite number of independent occurrences D;, D2, D3, Da, Ds 
...D,, and when R constitute an occurrence with P(R)>0, that is the combined subset of all D;, then for each 
Dj, then for each Di, Bayes formula which can be generalized as: 


P(RID;) Pa 


P(D,IR) = ae P(R|D; Py) 


(12) 


In (12) results from (11) due to the total probability theorem (6) and (7). With the recognized observational 
data, Bayes' theorem may be used to calculate the posterior probability of a hypothesis. 


3.2. Naive Bayes classification 

Naive Bayesian learning adverts to the formation of a Bayesian probability model, which applies the 
posterior class of probability to a case: P(Y=yX = xi). Naive Bayes classifier [26] applies this probability to 
give a case to a class. Implement Bayes' Theorem (11) and simplify the notation, it would be obtained: 


P xi|¥j POs) 


POyba) =2 aa (13) 


Where the numerator in the (13) is the combined probability of x; and y; (10). As a result, the denominator 
can be changed into: only use x, omit the index i for simplify: 


P(aly))P OF) = P& yj) = P(1|x2,x3,..., xp, yj) P(x2|x3,x4,...,xp, yj) P(xplys) PO) 


Suppose the individual xi is not dependent one another. This is a strong hypothesis that clearly is against 
practical application, and thus Naive-as the suggested name. This supposition leads to P(xi|x2, x3, ..., Xp yj) = 
P (xily;). So, the combined probability of x and y; is: 


P(x|yj) = Meas P(e ly) POW) (14) 
which can be entered into press (13), so that there are: 


Me=1P(XK|¥;)POs) 
P(yj|x) = oe (sel) = (15) 


It should be noted that the denominator, P(x), has nothing to do with the category, for example for 
the categories y; and y, are the same. P(x) works to be a scale factor and convinces that the posterior 
probability P(yj|x) is appropriately scaled. If we focus in clear classification rules, that is, to accurately assign 
each case to a class, we only need to calculate the numerator of each class and choose the maximum value of 
this value. The regulation is called the posterior maximum rule (16). The result class is also called the 
posterior maximum class (MAP), for the case of x it is calculated as y*: 


y = argmaxy, Te P(x«ly,)P(y) (16) 


The maximum likelihood probability of a word belonging to a certain category is set by (17): 


Number of words x; in class c document 


P(x;lc) = (17) 


Total number of words in class c document 
According to Bayes' rule, the probability which a particular document corresponds to class c; is given by: 


P(d|c;)P(c) 


P(eyld) = AE 


(18) 
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If we use the assumption of conditional free simple form, that we know a class, the words are conditionally 
independent of each other. Because of this assumption, the model is called Naive. 


T1P(x;|cj)Pce;) 


P(e;|4) = Sth 


(19) 


Here xi is a word from the document. Classifier returns the class with the maximum posterior probability. 


3.3. User 

To determine what is the probability that the selected word is a verb. So, to determine the probability 
of the verb obtained in the dictionary, it is necessary to first describe the following things. For users, do the 
following steps to get a new model, as: i) number of users: U; ii) with U: {b,, bo, ..., by, }; 111) average number of 
texts user: T; iv) with T = tP1, tP2, 0. tPu: v) the meaning is t1is the text from user b,, t»2is the text from 
user bj; vi) average number of words per user: W; vii) with W = wt, w2,..., wee; viii) average number of 
verbs from the dictionary: V; ix) so, for the whole community; x) the number of texts from all users is T.U; xi) 
the number of words for all users is W.U; xii) the order of occurrence of the word R; xiii) choose a word from 
the set W taken from the set of texts T from community U; xiv) the problem is to determine what is the 
probability that this selected word is a verb that belongs to the R occurrence ranking; and xv) the probability 
of choosing a word from the set W from the entire text is: 


Pw |r) = (20) 


Take the class t”/ where j=1,2,..., u is the text class that comes from user j € U. The probability 
that the class t”/ given that the word w”? \ is in that class can be revised in conditional probability as: 


p(wi|t?s)pces 


P(t?i|w%) = Pweiy 


(21) 


In this equation P(wi|t?s ) = = (from the Press. 20). The probability of choosing class t?/ is = While the 


probability of choosing the word w”? is = 


bil..bs ee eu) Vp 
P(t i|w i) = =i 
Ww 


Wey) w 


yi 


or = 


WU is the total number of words from all users, TU total number of texts from all users, W total 
number (words)/user, T total number (texts)/user. Now we want to determine that the selected word is a verb. 
The number of verbs in the dictionary is V. The probability of choosing a word wi knowing that it is a verb, 
is written as: 


P(w|v) = <= (22) 
and the probability that the word w” is contained in the V verb dictionary. 


P(wi|V)P(V) 


ECW aaa 


Z MU *y 
Why 


_ (wuyw 
= 
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So now what we want to determine is the probability that the word w”? \ is included in the text of T 
and is included in the verb dictionary V. In other words, the word w”: is a verb. 


P(w|V,T) = PV|w®)P(T|w) 
P(V|w”) 


wiv 


Is the probability of selecting a verb from the user b;. That is W, P(T|w”%). It is the probability that the 


y2 
a WERE a oa it 
selected word from user b; comes from the text. That is — TE With this new model, we can get a new approach 


to find opportunities for user habits on Twitter social media, making it easier for us to analyze user habits. 


4. CONCLUSION 

User habits on Twitter social media can be used for new knowledge. The same as getting a model by 
getting a mathematical model to look for word opportunities in the text that are included in the verb 
dictionary to be able to get users' habits on Twitter social media. Where is very helpful in terms of finding 


bj wi 


user habits. The design of the new model — 7 can be used to find user habits. In the future, with this 


model, users can get habits, so that it is useful for other things such as in the field of product promotion, 
communities that have the same habits and others. 
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